A Hybrid Feature Selection Optimization Model for High Dimension Data Classification

نویسندگان

چکیده

Feature selection is an NP-hard combinatorial problem, in which the number of possible feature subsets increases exponentially with features. In case large dimensionality, goal to determine smallest features considering most informative subset. this paper, we proposed a hybrid optimization model for Cancer Classification called, ENSVM. Our based on using Elastic Net (EN) method that regulates and selects variables gene genomic microarray data. We applied three different techniques namely Social Ski-Driver (SSD), Randomized SearchCV (RS) NetCV (ENCV) determining traditional Support Vector Machines classification. To evaluate model, compared results applying ENSVM seven data SSD-SVM SVM (RBF) kernel without any method. The comparison revealed effect selecting optimal subset maximized classification performance. Accordingly, minimizing significant when analyzing high dimensional performance nevertheless accuracy. Moreover, superior model.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

H-BwoaSvm: A Hybrid Model for Classification and Feature Selection of Mammography Screening Behavior Data

Breast cancer is one of the most common cancer in the world. Early detection of cancers cause significantly reduce in morbidity rate and treatment costs. Mammography is a known effective diagnosis method of breast cancer. A way for mammography screening behavior identification is women's awareness evaluation for participating in mammography screening programs. Todays, intelligence systems could...

متن کامل

A Classification Method for E-mail Spam Using a Hybrid Approach for Feature Selection Optimization

Spam is an unwanted email that is harmful to communications around the world. Spam leads to a growing problem in a personal email, so it would be essential to detect it. Machine learning is very useful to solve this problem as it shows good results in order to learn all the requisite patterns for classification due to its adaptive existence. Nonetheless, in spam detection, there are a large num...

متن کامل

Feature Selection for Small Sample Sets with High Dimensional Data Using Heuristic Hybrid Approach

Feature selection can significantly be decisive when analyzing high dimensional data, especially with a small number of samples. Feature extraction methods do not have decent performance in these conditions. With small sample sets and high dimensional data, exploring a large search space and learning from insufficient samples becomes extremely hard. As a result, neural networks and clustering a...

متن کامل

a hybrid feature subset selection algorithm for analysis of high correlation proteomic data

pathological changes within an organ can be reflected as proteomic patterns in biological fluids such as plasma, serum, and urine. the surface-enhanced laser desorption and ionization time-of-flight mass spectrometry (seldi-tof ms) has been used to generate proteomic profiles from biological fluids. mass spectrometry yields redundant noisy data that the most data points are irrelevant features ...

متن کامل

Optimal Feature Selection for Data Classification and Clustering: Techniques and Guidelines

In this paper, principles and existing feature selection methods for classifying and clustering data be introduced. To that end, categorizing frameworks for finding selected subsets, namely, search-based and non-search based procedures as well as evaluation criteria and data mining tasks are discussed. In the following, a platform is developed as an intermediate step toward developing an intell...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3065341